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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Anderson, David J. 

Saito, Tetsuichiro 

5 (ii) TITLE OF INVENTION: A NOVEL HOMEODOMAIN PROTEIN 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

10 (B) STREET: Four Embarcadero Center, Suite 3400 

(C) CITY: San Francisco 

(D) STATE : California 

(E) COUNTRY: United States 

(F) ZIP: 94111 

15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

20 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/701,278 

(B) FILING DATE: 22-AUG-1996 

(C) CLASSIFICATION: 
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(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Silva, Robin M. 

(B) REGISTRATION NUMBER: 38,304 

(C) REFERENCE/DOCKET NUMBER: A-63770-1 



(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 781-1989 
30 (B) TELEFAX: (415) 398-3249 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCAGAGGTAG GCAGGGTTCC CGAGCCGCTC TCCCGGCTCC CTGCTCTGGG CCTTGGGGCT 60 



4 0 CCACCGGCTT CTTGGCCCGA GCTGCTGCGC GTGCAGATGG CCTTGCGCGA TCGCCGGACC 12 0 
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CCGCTGCGGT GGCCAAGTGC AGGGCTTGTG GCTGGGACCC CTGAGAACCA GGAGCCAGAC 180 
TGTGCTCAGC TTGCCAGGCC GGAGCCACGC ACGGGCACAA GTCTGTCAGG CCGCCATCAG 240 
TCCTGGTCCA GCCGTCAGGG CCCATCCGAC CGTCGGCGAT GTTTTATTTC CACTGCCCGC 3 00 
CACAGCTAGA GGGCACAGCG CCTTTTGGTA ACCACTCTAC GGGGGATTTT GATGATGGGT 360 
5 TTCTTAGAAG AAAACAGCGC AGAAATCGGA CAACCTTCGC TCTTCAGCAG TTGGAAGCTC 420 
TGGAGGCAGT CTTTGCCCAA ACACACTACC CAGATGTCTT CACCAGAGAA GAGCTAGCCA 480 
TGAAAATAAA CCTCACAGAA GCCAGAGTGC AGGTTTGGTT CCAGAAC CGA AGAGCCAAGT 54 0 
GGAGGAAGAC AGAGAGAGGG GCCTCTGACC AGGAACCAGG GGCTAAGGAA CCCATGGCAG 600 
AGGTGACACC ACCCCCAGTG AGGAACATCA ACTCTCCACC CCCAGGGGAC CAGGCCCGGG 660 

10 GCAAGAAGGA GGCCCTGGAG GCCCAGCAGA GCCTGGGACG CACAGTGGGC CCCGCCGGGC 72 0 
CTTTCTTCCC CTCCTGCTTG CCAGGGACCC TCCTGAACAC AGCCACTTAT GCCCAGGCCC 780 
TGTCCCATGT GGCATCTCTG AAAGGGGGCC CACTGTGCTC TTGCTGCGTC CCAGACCCTA 840 
TGGGGCTCTC CTTCCTCCCC ACTTACGGTT GCCAGAGTAA CCGCACAGCC AGCGTGGCTG 900 
CCCTGCGCAT GAAGGCCCGC GAGCATTCAG AAGCGGTCCT GCAGTCTGCC AACCTTCTGC 960 

15 CGTCCACCAG CAGCAGCCCC GGCCCTGCCT CCAAGCAGGT GCCTCCAGAA GGCAGCCAGG 1020 



lGCCCTC CCCAACGAAG GAACAGAGCG AGGGAGAGAA GAGCGTATGA GGGTCCGGAG 1080 



AACCCAGCTG GGAGCCCTGC CCACCCCTGC TTCTCTCAGC CTCAGCCCTG CCAGCCTCTG 114 0 
AACCACAAGG AGTAGCCACC TCCTCATGGA TCTGACAGGG CAAACGGGAC CTGCAAGCTG 12 00 
GTTGAGACCT GAAGAGTCCC TCTAGAATTC TGCTGGTAGG CTGTGTTGTT CTCGCTTTTC 1260 

20 CTTTGGTGAC ATTTTCCGAT GGCTCTTAGT GACTCTGGAC ACTGCTCTGT GATGAGGTCC 132 0 
CTGTTTTTTG CTTTTTGTTT TGTCTCTTTT TTTTTGTTTT GTTTTGTTTT ATTTTCCAGG 1380 
CCAAGCAGCC TTGGAGCAAA GCAGATTAGT TTATTCCACC ATCCTTCTTG AGATATCTGG 1440 
GAAGGTCTTG TCAATTCCAA GGACTGTGGC AAGGATCATC CGTGAAAGAT GCCAAGAAGT 1500 
GACATCTCAT GACAGGAAAT GAGACGGGCA CTCCCATATT GCTTAAGAAC CACAGAACTG 1560 

25 GTGGACTATC AGCCAGTTCT CACTCCCTTC AGCCAGGACT GGCATCGGCC TCCTTTGTCT 1620 
TGTTTAAAGG AATTAGCTGA GGTTTTGGCT AGGAAGTGAC AAGATATGGG CTGAAGACAT 1680 
TGTGGTCCTG ACCCTAGCAG ATCTCCCTGG GCACATCTGA CCTGGTCCAG TCAGGCAGGT 1740 
TGTCAGTTCG GGGATGGGGG CTGCTCTGCT GATTCTGTGT GTGGGTTCCC TGCAATTAGA 1800 
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GTGTTCACTT GCAGGCCCCG CTCTCTTCAG AAGAGTGATG GGAAGTTCAC CAATCAGAAT 1860 

GTAGCTTTGT AGCCCAGGAA AGGACCAGAG TCCTTGAAGC GGTAGGAAAT CCCTAGGAAG 1920 

GCCCCTTAAA TACTTATGCC CAGATGAGCT GCCCTTCTTC CTATCCCCGT ATGTCGAGAG 1980 

GTTGACGAGA CAGGAAAGCC AGGAAGATGA CTCCGTGTGG CAGAAGAGAA TGGAGTCCAA 2040 

5 AGGGCCAACT TTCACAGAGA TTTCTGCCGC AGTTTAGCGT GGCTGTGTTC TTTCACGCGA 2100 

TGGTGACTTC GGAGAGATCA GAGGGAGATG TGCAATAGCA TGAGCCCCGC TCCTGGCCCG 2160 

GGTCCTGGAA AGGTTGTGGT TGTTTGGTGG CTTTGGCTGA TGATGTTTCC ACGCAAACAG 2220 

ATATTGCTTT CATGATGGCT GTTCTCATTT CAGTTCTGAT AATCGAGACG CTGTGCTCCC 2280 

AGGCGCTCTG CCTCCCCTTA ACTCTTCAGG AGCACCCCCT CCCCTGTAAT ACTCCTAAGT 2340 

10 GTATCGTGCC TCACTTACGG TTACTGCAAC ACATTTGATG GAACACACTG TCTCCTTTAA 2400 

AAAAGAAAAA AAAAAAAAAA AAAA 2424 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



20 Met Phe Tyr Phe His Cys Pro Pro Gin Leu Glu Gly Thr Ala Pro Phe 

15 10 15 

Gly Asn His Ser Thr Gly Asp Phe Asp Asp Gly Phe Leu Arg Arg Lys 
20 25 30 

Gin Arg Arg Asn Arg Thr Thr Phe Ala Leu Gin Gin Leu Glu Ala Leu 
25 35 40 45 

Glu Ala Val Phe Ala Gin Thr His Tyr Pro Asp Val Phe Thr Arg Glu 
50 55 60 

Glu Leu Ala Met Lys lie Asn Leu Thr Glu Ala Arg Val Gin Val Trp 
65 70 75 80 

3 0 Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys Thr Glu Arg Gly Ala Ser 

85 90 95 

Asp Gin Glu Pro Gly Ala Lys Glu Pro Met Ala Glu Val Thr Pro Pro 
100 105 110 



Pro Val Arg Asn lie Asn Ser Pro Pro Pro Gly Asp Gin Ala Arg Gly 
115 120 125 



Lys Lys Glu Ala Leu Glu Ala Gin Gin Ser Leu Gly Arg Thr Val Gly 
130 135 140 

Pro Ala Gly Pro Phe Phe Pro Ser Cys Leu Pro Gly Thr Leu Leu Asn 
145 150 155 160 

Thr Ala Thr Tyr Ala Gin Ala Leu Ser His Val Ala Ser Leu Lys Gly 
165 170 175 

Gly Pro Leu Cys Ser Cys Cys Val Pro Asp Pro Met Gly Leu Ser Phe 
180 185 190 

Leu Pro Thr Tyr Gly Cys Gin Ser Asn Arg Thr Ala Ser Val Ala Ala 
195 200 205 

Leu Arg Met Lys Ala Arg Glu His Ser Glu Ala Val Leu Gin Ser Ala 
210 215 220 

Asn Leu Leu Pro Ser Thr Ser Ser Ser Pro Gly Pro Ala Ser Lys Gin 
225 230 235 240 

Val Pro Pro Glu Gly Ser Gin Asp Lys Pro Ser Pro Thr Lys Glu Gin 
245 250 255 

Ser Glu Gly Glu Lys Ser Val 
260 

INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Gly Phe Leu Arg Arg Lys Gin Arg Arg Asn Arg Thr Thr Phe Ala Leu 
15 10 15 

Gin Gin Leu Glu Ala Leu Glu Ala Val Phe Ala Gin Thr His Tyr Pro 
20 25 30 

Asp Val Phe Thr Arg Glu Glu Leu Ala Met Lys lie Asn Leu Thr Glu 
35 40 45 

Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys 
50 55 60 



Thr Glu Arg Gly Ala Ser 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Leu His Glu Lys Arg Lys Gin Arg Arg He Arg Thr Thr Phe Thr Ser 
10 1 5 10 15 

Ala Gin Leu Lys Glu Leu Glu Arg Val Phe Ala Glu Thr His Tyr Pro 
20 25 30 

Asp He Tyr Thr Arg Glu Glu Leu Ala Leu Lys He Asp Leu Thr Glu 
35 40 45 

15 Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Phe Arg Lys 

50 55 60 

Gin Glu Arg Ala Ala Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO : 5 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Leu Pro Leu Lys Arg Lys Gin Arg Arg Ser Arg Thr Thr Phe Thr Ala 
15 10 15 

Glu Gin Leu Glu Glu Leu Glu Arg Ala Phe Glu Arg Thr His Tyr Pro 
30 20 25 30 

Asp He Tyr Thr Arg Glu Glu Leu Ala Gin Arg Ala Lys Leu Thr Glu 
35 40 45 

Ala Arg Val Gin Val Trp Phe Ser Asn Arg Arg Ala Arg Trp Arg Lys 
50 55 60 
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Gin Ala Gly Ala Asn Gin 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asn Val Ser Ser Ser Lys Lys Arg Arg His Arg Thr Thr Phe Thr Ser 
10 1 5 10 15 

Leu Gin Leu Glu Glu Leu Glu Lys Val Phe Gin Lys Thr His Tyr Pro 

20 ' 25 30 

Asp Val Tyr Val Arg Glu Gin Leu Ala Leu Arg Thr Glu Leu Thr Glu 
35 40 45 

15 Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys 

50 55 60 

Arg Glu Arg Tyr Gly Gin 
65 70 

(2) INFORMATION FOR SEQ ID NO: 7: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Glu Lys Lys Lys Arg Lys Gin Arg Arg Asn Arg Thr Thr Phe Asn Ser 
15 10 15 

Ser Gin Leu Gin Ala Leu Glu Arg Val Phe Glu Arg Thr His Tyr Pro 
30 20 25 30 

Asp Ala Phe Val Arg Glu Asp Leu Ala Arg Arg Val Asn Leu Thr Glu 
35 40 45 

Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Phe Arg Arg 
50 55 60 

35 Asn Glu Arg Ala Met Leu 

65 70 
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(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Thr Lys Arg Lys Lys Lys Gin Arg Arg Asn Arg Thr Thr Phe Asn Ser 
10 1 5 10 15 

Ser Gin Leu Gin Ala Leu Glu Arg Val Phe Glu Arg Thr His Tyr Pro 
20 25 30 

Asp Ala Phe Val Arg Glu Glu Leu Ala Arg Arg Val Asn Leu Ser Glu 
35 40 45 

15 Ala Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Phe Arg Arg 

50 55 60 

Asn Glu Arg Ala Met Leu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 9: 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Leu Gin Leu Lys Arg Lys Leu Gin Arg Asn Arg Thr Ser Phe Thr Gin 
15 10 15 

Glu Gin lie Glu Ala Leu Glu Lys Glu Phe Glu Arg Thr His Tyr Pro 
30 20 25 30 

Asp Val Phe Ala Arg Glu Arg Leu Ala Ala Lys lie Asp Leu Pro Glu 
35 40 45 

Ala Arg lie Gin Val Trp Phe Ser Asn Arg Arg Ala Lys Trp Arg Arg 
50 55 60 



35 Glu Glu Lys Leu Arg Asn 

65 70 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Tyr Ala Pro Lys Arg Lys Gin Arg Arg Tyr Arg Thr Thr Phe Thr Ser 
15 10 15 

Phe Gin Leu Glu Glu Leu Glu Lys Ala Phe Ser Arg Thr His Tyr Pro 
20 25 30 

Asp Val Phe Thr Arg Glu Glu Leu Ala Met Lys lie Gly Leu Thr Glu 
35 40 45 

Ala Arg lie Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys 
50 55 60 

Gin Glu Lys Val Gly Pro 
65 70 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

He Ala Leu Lys Arg Lys Gin Arg Arg Cys Arg Thr Thr Phe Ser Ala 
15 10 15 

Ser Gin Leu Asp Glu Leu Glu Arg Ala Phe Glu Arg Thr Gin Tyr Pro 
20 25 30 

Asp He Tyr Thr Arg Glu Glu Leu Ala Gin Arg Thr Asn Leu Thr Glu 
35 40 45 

Ala Arg He Gin Val Trp Phe Ser Asn Arg Arg Ala Arg Leu Arg Lys 
50 55 60 



Gin His Thr Ser Val Ser 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp Pro Asn Gly lie Lys Lys Lys Lys Thr Arg Thr Thr Phe Thr Ala 
10 1 5 10 15 

Tyr Gin Leu Glu Glu Leu Glu Arg Ala Phe Glu Arg Ala Pro Tyr Pro 
20 25 30 

Asp Val Phe Ala Arg Glu Glu Leu Ala lie Lys Leu Asn Leu Ser Glu 
35 40 45 



15 Ser Arg Val Gin Val Trp Phe Gin Asn Arg Arg Ala Lys Trp Arg Lys 

50 55 60 

His Glu Pro Pro Arg Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 13: 

* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Glu Ser Ala Ala Lys Arg Arg Arg Thr Arg Thr Asn Phe Ser Gly 
15 10 15 




Trp Gin Leu Glu Glu Leu Glu Ser 
20 

Asp Val Phe Met Arg Glu Ala Leu 
35 40 

Ser Arg Val Gin Val Trp Phe Gin 
50 55 



Ala Phe Glu Ala Ser His Tyr Pro 
25 30 

Ala Met Arg Leu Asp Leu Leu Glu 
45 

Asn Arg Arg Ala Lys Trp Arg Lys 
60 
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Arg Glu Gin Asn Arg Asn 
65 70 



s 

i 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Phe Thr Ala Tyr Gin Leu Glu 
10 1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gin Val Trp Phe Gin Asn Arg 
20 1 5 



INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
30 (B) LOCATION: 14.. 15 

(D) OTHER INFORMATION: /note= "The ' N 1 appearing at 
position 14 represents inosine." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
35 (B) LOCATION: 17 . . 18 

(D) OTHER INFORMATION: /note= "The f N' appearing at 
position 17 represents inosine." 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 26.. 27 

(D) OTHER INFORMATION: /note= "The 'N» appearing at 
5 location 26 represents inosine." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

CGGGATCCTT YACNGCNTAY CARYTNGA 28 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 25.. 26 

(D) OTHER INFORMATION: /note= "The f N' appearing at 
position 25 represents inosine." 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAATTCKR TTYTGRAACC ANACYTG 27 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
30 TCATGAAGTG TGACGTTGAC ATCC 24 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

3 5 (C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



48 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GTAAAACGCA GCTCAGTAAC AGTC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
ACTGAAGCCA AGGTAGGATC CG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CGGATCCTAC CTTGGCTTCA GTAG 
((2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CTACTGAAGC CAAGGTAGGA TCCG 



